Ε-isa: an Incremental Lower Bound Approach for Efficiently Finding Approximate Nearest Neighbor of Complex Vague Queries

نویسندگان

  • Tran Khanh DANG
  • Josef KÜNG
  • Roland WAGNER
چکیده

In our context, a complex vague query means a multifeature nearest neighbor query. Answering such queries requires the system to search on some feature spaces individually and then combine the searching results to find the final answers. The feature spaces are commonly multidimensional spaces and may consist of a vast amount of data. Therefore searching costs, including IO-cost and CPU-cost, are prohibitively expensive for complex vague queries. For only such a single-feature space, to alleviate the costs, problem of answering nearest neighbor and approximate nearest neighbor queries has been proposed and quite well-addressed in the literature. A data object P is called a (1+ε)approximate nearest neighbor of a given query object Q with ε>0 if for all other data objects P’: dist(P, Q) (1+ε)dist(P’, Q), in which dist(X, Y) represents the distance between objects X and Y. In this paper, however, we introduce an approach for finding (1+ε)approximate nearest neighbor(s) of complex vague queries, which must deal with the problem on multiple feature spaces. This approach is based on a novel, efficient and general algorithm called ISA-Incremental hyper-Sphere Approach, which has just recently been introduced for solving nearest neighbor problem in the Vague Query System (VQS). To the best of our knowledge, the work presented in this paper is one of a few vanguard solutions for dealing with problem of answering approximate multi-feature nearest neighbor queries. The experimental results with both uniformly distributed and real data sets will prove the efficiency of the proposed approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Space-Time Tradeoffs for Proximity Searching in Doubling Spaces

We consider approximate nearest neighbor searching in metric spaces of constant doubling dimension. More formally, we are given a set S of n points and an error bound ε > 0. The objective is to build a data structure so that given any query point q in the space, it is possible to efficiently determine a point of S whose distance from q is within a factor of (1 + ε) of the distance between q and...

متن کامل

Probabilistic Voronoi Diagrams for Probabilistic Moving Nearest Neighbor Queries

Article history: Received 9 November 2010 Received in revised form 4 February 2012 Accepted 6 February 2012 Available online 21 February 2012 A large spectrum of applications such as location based services and environmental monitoring demand efficient query processing on uncertain databases. In this paper, we propose the probabilistic Voronoi diagram (PVD) for processing moving nearest neighbo...

متن کامل

Optimal Approximate Polytope Membership

In the polytope membership problem, a convex polytope K in R is given, and the objective is to preprocess K into a data structure so that, given a query point q ∈ R, it is possible to determine efficiently whether q ∈ K. We consider this problem in an approximate setting and assume that d is a constant. Given an approximation parameter ε > 0, the query can be answered either way if the distance...

متن کامل

(Approximate) Conic Nearest Neighbors and the induced Voronoi Diagram

For a given point set in Euclidean space we consider the problem of finding (approximate) nearest neighbors of a query point but restricting only to points that lie within a fixed cone with apex at the query point. Apart from being a rather natural question to ask, solutions to this problem have applications in surface reconstruction and dimension detection. We investigate the structure of the ...

متن کامل

Approximate line nearest neighbor in high dimensions

We consider the problem of approximate nearest neighbors in high dimensions, when the queries are lines. In this problem, given n points in R, we want to construct a data structure to support efficiently the following queries: given a line L, report the point p closest to L. This problem generalizes the more familiar nearest neighbor problem. From a practical perspective, lines, and low-dimensi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008